Fix duplicate arguments passed to dummy inputs in ONNX export by lewtun · Pull Request #16045 · huggingface/transformers

lewtun · 2022-03-10T13:02:48Z

What does this PR do?

This PR fixes:

a bug that was introduced in Add ONNX export for ViT #15658 where a preprocessor and tokenizer were being passed together to the generate_dummy_inputs() function during the ONNX export.
an oversight in the refactoring of the ONNX config for M2M-100

It also removes problematic TensorFlow integration tests, where the model implementation doesn't have parity with the PyTorch one (e.g. camembert-base is missing the causal LM head in TensorFlow). I'll address those issues in separate PRs as it involves touching the TensorFlow modeling files.

With these fixes, all slow ONNX tests now pass in all environments (only torch, only tensorflow, torch and tensorflow):

RUN_SLOW=1 python -m pytest tests/onnx/test_onnx_v2.py

cc @michaelbenayoun

HuggingFaceDocBuilderDev · 2022-03-10T13:07:37Z

The documentation is not available anymore as the PR was closed or merged.

LysandreJik

Looks good! Left two comments that should be applied 4 times each :)

LysandreJik · 2022-03-10T14:35:26Z

src/transformers/onnx/convert.py

        `Tuple[List[str], List[str]]`: A tuple with an ordered list of the model's inputs, and the named inputs from
        the ONNX configuration.
    """
+    from ..tokenization_utils_base import PreTrainedTokenizerBase


I think this can be a top-level import

LysandreJik · 2022-03-10T14:36:14Z

src/transformers/onnx/convert.py

            "The `tokenizer` argument is deprecated and will be removed in version 5 of Transformers. Use `preprocessor` instead.",
            FutureWarning,
        )
+        logger.warning("Overwriting the `preprocessor` argument with `tokenizer` to generate dummmy inputs.")


Maybe this can be an info as it's more additional information and not really an error

(warnings get displayed by default, info is displayed when users ask to have more info)

LysandreJik · 2022-03-10T14:36:41Z

src/transformers/onnx/convert.py

    import onnx
    import tf2onnx

+    from ..tokenization_utils_base import PreTrainedTokenizerBase


Same comment about top-level

LysandreJik · 2022-03-10T14:36:48Z

src/transformers/onnx/convert.py

            "The `tokenizer` argument is deprecated and will be removed in version 5 of Transformers. Use `preprocessor` instead.",
            FutureWarning,
        )
+        logger.warning("Overwriting the `preprocessor` argument with `tokenizer` to generate dummmy inputs.")


Same comment about logging level

sgugger

Good for me with Lysandre's comments! Thanks for working on this!

Fix duplicate arguments passed to dummy inputs in ONNX export

e627069

lewtun changed the title ~~Fix duplicate arguments passed to dummy inputs in ONNX export~~ [WIP] Fix duplicate arguments passed to dummy inputs in ONNX export Mar 10, 2022

lewtun added 2 commits March 10, 2022 14:32

Fix logging messages

0af46e6

Fix M2M100 ONNX config

d6e0361

lewtun changed the title ~~[WIP] Fix duplicate arguments passed to dummy inputs in ONNX export~~ Fix duplicate arguments passed to dummy inputs in ONNX export Mar 10, 2022

lewtun requested review from LysandreJik and sgugger March 10, 2022 13:56

LysandreJik approved these changes Mar 10, 2022

View reviewed changes

sgugger approved these changes Mar 10, 2022

View reviewed changes

lewtun added 5 commits March 10, 2022 16:12

Integrate reviewer comments

abe606d

Ensure we check PreTrained model only if torch is available

ee6a1e0

Remove TensorFlow tests for models without PyTorch parity

8f95586

Remove GPT-Neo from TF tests

c748f00

Remove GPT-2 from TF ONNX tests

6c28992

lewtun merged commit 6b09328 into master Mar 10, 2022

lewtun deleted the fix-onnx-dummies branch March 10, 2022 19:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix duplicate arguments passed to dummy inputs in ONNX export#16045

Fix duplicate arguments passed to dummy inputs in ONNX export#16045
lewtun merged 8 commits intomasterfrom
fix-onnx-dummies

lewtun commented Mar 10, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 10, 2022 •

edited

Loading

Uh oh!

LysandreJik left a comment

Uh oh!

LysandreJik Mar 10, 2022

Uh oh!

LysandreJik Mar 10, 2022

Uh oh!

LysandreJik Mar 10, 2022

Uh oh!

LysandreJik Mar 10, 2022

Uh oh!

LysandreJik Mar 10, 2022

Uh oh!

sgugger left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

lewtun commented Mar 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Mar 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

LysandreJik Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

LysandreJik Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

LysandreJik Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

LysandreJik Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lewtun commented Mar 10, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 10, 2022 •

edited

Loading